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Abstract. The distribution of trade sizes and trading volumes are investigated based on the limit order book data of 
22 liquid Chinese stocks listed on the Shenzhen Stock Exchange in the whole year 2003. We observe that the size 
distribution of trades for individual stocks exhibits jumps, which is caused by the number preference of traders when 
placing orders. We analyze the applicability of the "g-Gamma" function for fitting the distribution by the Cramer-von 
Mises criterion. The empirical PDFs of trading volumes at different timescales At ranging from 1 min to 240 min can 
be well modeled. The applicability of the g-Gamma functions for multiple trades is restricted to the transaction numbers 
An ^ 8. We find that all the PDFs have power-law tails for large volumes. Using careful estimation of the average tail 
exponents a of the distribution of trade sizes and trading volumes, we get q > 2, well outside the Levy regime. 

PACS. 89.65.Gh Economics; econophysics, financial markets, business and management - 89. 75. Da Systems obeying 
scaling laws - 89.75.-k Complex systems 



1 Introduction 

A well-known adage says: It takes volume to move stock prices, 
indicating that the trade size and trading volume contain much 
information about the dynamics of price formation. The topic 
of price-volume relationship has a long history in finance lit] 
and recently has been investigated at the transaction level 12, 
[HHIllI^- Furthermore, understanding the origin of power-law 
tails in returns is an important issue. In the unified theory of 
Gabaix et al 10], the power-law tails of returns are related to 
the power-law tails of volumes, where the institutional activ- 
ity plays a crucial role |@, |^ [lOll . Zhou verified the relation 
at the transaction level, although the values of exponents are 
remarkably different from those of Gabaix ef aZ ja]. On the 
other hand. Farmer et al have found that large price changes at 
the transaction level are driven by liquidity fluctuations rather 
than by volume lITlll . In particular, they showed that the re- 
turn distribution is closely related to the distribution of gaps 
between the first two price levels on the order book. Webber 
and Rosenow argued that large stock price fluctuations can be 
explained quantitatively by taking into account both the order 



flow and the liquidity Ulil . In addition, Joulin et al also found 
that most price jumps are induced by order flow fluctuations 
close to the point of vanishing liquidity and volume plays a mi- 
nor role uM- In any case, the distributions of trading volume 
are of interest, also as part of stylized facts. 

The issue of the tail distribution of trade sizes is also con- 
troversial. The probability density function of trade sizes has a 
fat tail often described by a power law: 



f{v) (X V' 



(1) 



Gopikrishnan et al analyzed the transaction data for the largest 
1000 stocks traded on the three major US markets and found 
that the distribution of trade sizes follows a power-law tail with 
the exponent being 1.53±0.07, and called this as the half-cubic 
law |14, 15. 161. The aggregated trading volumes at timescales 
from a few minutes to several hundred minutes were found to 
have a power-law tail exponent 1.7 ±0.1 |[ll,[Tl,[il]El Maslov 
and Mills also found that the trade sizes of several NASDAQ 
stocks have power-law tails with the exponent close to 1 .4 ifisll . 
Plerou and Stanley extended this analysis to other two markets 
(LSE and Paris Bourse) and found quantitatively similar results 
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' It is noteworthy to point out that Farmer and Lillo investigated 
three LSE stocks and found no clear evidence for power-law tails ITtIi . 
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across the three distinct markets u9\\ . All these tail exponents 
were found to be well below 2, within the Levy regime. 

Alternatively, Eisler and Kertesz reported that the tail ex- 
ponents of the traded volumes at a timescale of 15 minutes for 
six NYSE stocks are 2.2 and 2.8 ll20l I21I1 . Racz, Eisler and 
Kertesz argued that the tail exponents of the trade size were 
underestimated ll22ll . They investigated the 1000 most liquid 
stocks traded on the NYSE for the same period as studied by 
Plerou and Stanley |[l9ll and found that the average tail expo- 
nent is 2. 02 ±0.45. The tail exponents were found to be outside 
the Levy regime for stocks in the Korean stocks 1I23I1 and the 
Chinese stocks 16]. 

The above mentioned studies concern with the tail behavior 
of the trade size or trading volume distribution. There are also 
efforts attempting to describe the whole distribution of trading 
volumes. The normalized trading volumes of 10 top-volume 
NYSE stocks and NASDAQ stocks at different timescales of 1 
min, 2 min and 3 min were fitted by the g-Gamma distribution 

il 



event-time interval An: 



1 /wx/S 



where 



Z = 



dv 



(2) 



(3) 



is the normalization constant, 9 and (3 are positive parameters, 
and the value of q is larger than 1 . The usage of the g-Gamma 
distribution can be motivated from a stochastic dynamical sce- 
nario ll25l I26I I27I1 . In the limiting case q I, the traditional 
Gamma probability density function is recovered. The asymp- 
totic behavior of the g-Gamma distribution has a power-law 
form (Eq. ([T]i) with 

f,G{v)^v'^^'-'^^+^. (4) 
Compared with Eq. ([T]i, we find the asymptotic tail exponent 



g-1 



(5) 



Here, we use a to make the difference between the empirical 
tail exponent and the fitting one. From the results of reference 
ll25ll . we find the tail exponent a is greater than 2 for 10 high- 
volume NASDAQ stocks (for which [3 and q are available) at 
different timescales of 1 min and 2 min. 

In this paper, we focus on investigating the distributions 
of trade sizes and trading volumes, based on limit order book 
data of 22 liquid stocks. The trade size uj = uji is defined as 
the number of shares exchanged in trade i, and trading volume 
is defined as the share volumes in a fixed time interval. Here 
we consider two types of trading volumes based on different 
definitions of the time interval, event time and clock time. The 
first one is defined as the total volume traded in a fixed clock- 
time interval At: 

NAt 

^At ^ , (6) 

1=1 

where Nai is the number of transactions in a fixed interval At. 
The second one is defined as the total volume traded in a fixed 



An 



^An — , 



(7) 



where An is the number of trades. The issue of trade sizes is a 
special case when An = 1. Our work deals with the problem 
of describing the whole distributions and with the estimation of 
the tail exponent. 

The paper is organized as follows. After a brief description 
of the data in Section |2] we investigate the distributions of the 
trade sizes (uj) and the two types of trading volumes {f2/i,t and 
fiAn) in Sections[3]|4]and|5] respectively. Section|6]concludes. 



2 Description of the data 

We used tick by tick data for 22 liquid stocks traded on the 
Shenzhen Stock Exchange (SZSE) in the whole year 2003. 
The market consists of three time periods on each trading day, 
namely, the opening call action (9:15 AM to 9:25 AM), the 
cooling period (9:25 AM to 9:30 AM), and the continuous dou- 
ble auction (9:30 AM to 1 1:30 AM and 1:00 PM to 3:00 PM). 
In this paper, we consider only the transactions occurring in 
the double continuous auction. The size of each transaction is 
recorded in the data sets. In the Chinese stock market, the size 
of a buy order is limited to a board lot of 100 shares or an inte- 
ger multiple thereof, while a seller can place a sell order with 
any size. The recorded trade size is in units of shares. 

The tickers of the 22 stocks investigated are the follow- 
ing: 000001 (Shenzhen Development Bank Co. Ltd: 887,741 
trades), 000002 (China Vanke Co. Ltd: 509,360 trades), 000009 
(China Baoan Group Co. Ltd: 447,660 trades), 000012 (CSG 
holding Co. Ltd: 290,148 trades), 000016 (Konka Group Co. 
Ltd: 188,526 trades), 000021 (Shenzhen Kaifa Technology Co. 
Ltd: 411,326 trades), 000024 (China Merchants Property De- 
velopment Co. Ltd: 133,586 trades), 000027 (Shenzhen Energy 
Investment Co. Ltd: 313,057 trades), 000063 (ZTE Corpora- 
tion, 265,450 trades), 000066 (Great Wall Technology Co. Ltd: 
277,262 trades), 000088 (Shenzhen Yan Tian Port Holdings 
Co. Ltd: 97,195 trades), 000089 (Shenzhen Airport Co. Ltd: 
189,1 17 trades), 000429 (Jiangxi Ganyue Expressway Co. Ltd: 
117,424 trades), 000488 (Shandong Chenming Paper Group 
Co. Ltd: 120,097 trades), 000539 (Guangdong Electric Power 
Development Co. Ltd: 114,721 trades), 000541 (Foshan Elec- 
trical and Lighting Co. Ltd: 68,737 trades), 000550 (Jiangling 
Motors Co. Ltd: 346,176 trades), 000581 (Weifu High-Technology 
Co. Ltd: 93,947 trades), 000625 (Chongqing Changan Auto- 
mobile Co. Ltd: 397,393 trades), 000709 (Tangshan Iron and 
Steel Co. Ltd: 207,756 trades), 000720 (Shandong Luneng Tais- 
han Cable Co. Ltd: 132,233 trades), and 000778 (Xinxing Duc- 
tile Iron Pipes Co. Ltd: 157,321 trades). 

The 22 stocks investigated in this work cover a variety of 
industry sectors such as financials, real estate, conglomerates, 
metals & nonmetals, electronics, utilities, IT, transportation, 
petrochemicals, paper & printing and manufacturing. Our sam- 
ple stocks were part of the 40 constituent stocks included in the 
Shenshen Stock Exchange Component Index in 2003 ||6tl. 

Note that when we investigate trading volumes ^2At and 
f^An of all 22 stocks, we have normalized the trade sizes oj 
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Fig. 1. (Color online) (a) Empirical probability density function /(c 
probability density distributions f{uj/{ijj)) of the normalized trade sizei 
total number of outstanding shares to account for share splits, and (tj) i 
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a)/<Q)) 

) of the trade sizes of a typical stock (code 000001). (b) Empirical 
of 22 liquid stocks. Here, we have normalized the trade sizes u by the 
the mean trade size for individual stocks. 



by the total number of outstanding shares to account for share 
splits. 



3 Trade size distribution 

3.1 Trade size distribution of individual stocks 

In this section, we study the trade size distribution of indi- 
vidual stocks. Fig. [ija) displays the probability density func- 
tion (PDF) f{Lo) of a typical stock (Shenzhen Development 
Bank Co., LTD, code 000001) and Fig.Eb) shows the PDFs 
f {oj / (uj)) of the normalized trade sizes of 22 stocks, where (uj) 
is the mean trade size for individual stocks. The most intrigu- 
ing feature is that the PDFs are not continuous and there are 
evident jumps in the curves. The outliers seem rather equidis- 
tant on the log scale. The reason is that the traders seem to have 
a sense of number preference, which will be discussed in de- 
tail in Section 13.21 Therefore, the group of 22 PDF curves in 
Fig HIb) looks very thick. In the previous work, the cumulative 
distribution of trade sizes is investigated. In this way, the jumps 
are smoothened out. We note that this phenomenon disappears 
for trading volumes. 

3.2 Number preference 

Due to a variety of diverse natural and human factors, the num- 
bers' frequency of occurrence in human world shows an un- 
even behavior With the help of search engines, Dorogovtsev 
et al found the occurrence frequencies of numbers in the World 
Wide Web pages are very different, 777 and 1000, for example, 
occur much more frequently than their neighbors ll28ll . This sit- 
uation also happens in the stock market. The order price place- 
ment shows irrational preference of some numbers like 5, 10 
or their multiples ll29ll in Chinese stock market. The first ob- 
servation can trace back to Niederhoffer Isoll in 1965, showing 
the price are often on integers, then on halves, on quarters, and 
Harris did more works on it |31]. Moreover, the frequency of 
first digit in stock's price or return does not correspond to the 
frequency of i for each digit from 1 to 9 ll32i[33ll . which also re- 
flects people's number preference. Similar phenomenon exists 



in the number of trades, as shown in Fig. |2] This is accordant to 
the result of Alexander and Peterson ll34ll . Fig.|2]plots the num- 
ber of transactions with the same trade size as a function of the 
trade size for stock 000001 when the trade sizes are among 1 to 
10^ shares. It is observed that there are several layers of spikes 
in these plots. Fig.|2ta) shows the first-layer spikes locating at 
ijj = lO'^fc and the second-layer spikes at a; = 10'*(fc + 0.5), 
where k — 1,2, •• • ,9. The third-layer spikes at = 10^ A; 
and the fourth-layer spikes at a; = 10'^(fc + 0.5) can been seen 
in Fig.|2jb). Fig.|2jc) and (d) depict more layers of spikes for 
smaller trades. These spikes explain the jumps in the trade size 
PDFs in Fig.[T] Comparing Fig.|2d) with other three plots, we 
find that the magnitude of N for uj < 100 is much lower than 
those for large trades. 





100 200 300 400 500 600 700 800 900 1000 10 20 30 40 50 60 70 80 90 100 

CO (0 



Fig. 2. The number of transactions with the same trade size as a func- 
tion of the trade size for stock 000001. 
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3.3 Fitting the distribution 

We now fit the PDF of the normalized trade sizes v = lo/ (uj) 
of all the 22 stocks. The g-Gamma distribution (Eq. (|2])) is ap- 
plied to model the PDF. For comparison, we also adopt the q- 
exponential distribution fq{v) ll35ll36l [3^ 
ft(f) ll38ll and log-normal distribution fi 
as follows: 



student distribution 
, which are defined 



where 9 > and q > 1, 



TDf 1 n - 



H{v 



-{n+l)/2 



H 



(8) 



(9) 



where _B(-, •) is the "beta function", location parameter x, scale 
parameter H and degrees of freedom parameter n are positive, 
and 



1 



exp 



— (Inw — /i) 



20-2 



(10) 



where /i and a are corresponding mean and standard devia- 
tion respectively. The (jf-exponential (Eq. ([8])) has an asymptotic 
power-law tail whose tail exponent is 



- 1 



(11) 



The empirical PDF and two fits { fqc and fq) are illustrated 
in Fig. [3] because student and log-normal fits deviate consider- 
ably from the empirical PDF, especially the student function. It 
is visible that (/-exponential gives bad fit for small values, but 
a better fit than g-Gamma for large values. Using taboo search 
and least-squares estimator, we obtain that the parameters of 
the g-Gamma are 9 = 0.07, /? = 1.52, q = 1.22 and x = 0-57, 
while that of the q-exponential are 9 = 0.7, q = 1.45 and 
X = 0.74. According to Eq. Q and Eq. (fTTl i. the tail exponents 
a in the two models are 2.03 and 2.22, respectively. 

It is clear from Fig.|3]that the g-Gamma distribution fits re- 
markably well the empirical PDF for the whole interval, while 
the g-exponential does not. Here, the Cramer-von Mises cri- 
terion is used for judging the goodness-of-fit of the probabil- 
ity distribution compared to a given distribution ll39ll . which is 
given by 



M 



[F{v) - F*{v)Y AF{v) 



(12) 



where F* is the empirical cumulative distribution function, and 
F is the corresponding theoretical distribution. In one-sainple 
applications, the function can be described as follows ll40ll4lll . 



p2 



1 

12^ 



i=l 



2i - 1 



2n 



(13) 



where n is the sample size. At the significance level 0.01, we 
find that Cfj for g-Gamma is smaller than the critical value 
using Eq. [T3] while the g-exponential's is larger. So we can 
accept the hypothesis that the trade sizes data ui come from q- 
Gamma distribution rather than g-exponential distribution. 
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Fig. 3. (Color online) Fitting the empirical PDF f{uj/{iL>)) of trade 
sizes of all the 22 stocks, which have already been normalized by the 
total number of outstanding shares. The dashed curve is the g-Gamma 
fit using Eq. with 6 = 0.07, /3 = 1.52, q = 1.22 and x = 0.57 
and the solid curve is the q-exponential fit using Eq. l[8j with = 0.7, 
q — 1.45 and x ~ 0.74. The corresponding tail exponents a are 2.03 
and 2.22 respectively. 



3.4 Determination of the tail exponent 

Calibrating the g-Gamma distribution already provides an es- 
timate of the tail exponent. However, a careful comparison of 
the fitted curve with the empirical curve in the tail of Fig. [5] 
indicates that this estimate might be biased. Here, we utilize 
six methods to estimate the tail exponents, including the least- 
squares estimator (LSE) based on a linear fit of a power law. 
Hill's estimator (HE) that is a conditional maximum likelihood 
estimator ll42ll . Meerschaert and Scheffer's estimator (MSE) 
based on the behavior of moments ll43ll . Clauset, Shalizi and 
Newman's estimator (CSNE) based on maximum likelihood 
methods and the Kolmogorov-Smirnov statistic ll44ll . Fraga Alves's 
estimator (FAE) that is a location invariant Hill-type estimator 
1I45I1 . and shift-optimized Hill estimator by Racz and Kertesz 
(RKE) that can handle data shifts |46], which is an extension 
of the CSNE. The resultant estimates of the tail exponent using 
LSE, HE, MSE, CSNE, FAE and RKE are 2.33 ± 0.08, 1.54 ± 
0.07, 1.75 ± 0.003, 2.23 ± 0.23, 2.59 ± 0.39 and 2.13 ± 0.23, 
respectively. As pointed out by Racz et al., the MSE is unable 
to predict exponent above 2, and systematically underestimates 
the tail exponent 1I22I1 . while the FAE results in a high variance 

a. 



4 Trading volume at different timescales 
4.1 Bulk distribution 

The distributions of trading volumes f2^t in different clock- 
time intervals At have been studied for several other stock mar- 
kets. The PDFs of the normalized trading volumes at different 
timescales are depicted in Fig. H] for the Chinese stocks. The 
timescale At ranges from 1 min to 240 min (1 trading day). 
The width of the PDF decreases with the increase of timescale. 
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Fig. 4. (Color online) (a) Probability density functions of the normalized trading volumes at different timescales At ranging from 1 min to 240 
min (1 trading day), (b) Fitting the PDFs using g-Gamma (solid curves). The timescales from top to bottom are At — I min, 5 min, 15 min and 
60 min, respectively. We have shifted the curves for clarity. 



In addition, there is no scaling in these PDFs. For small values 
of f^At, the density decreases with At. For large values of f2At , 
the PDFs decay in power-law forms. 

We also fitted these PDFs using g-Gamma, g-exponential 
and log-normal densities except the student density. Only the 
results of g-Gamma fit for four typical cases with At ^ I min, 
5 min, 15 min and 60 min are illustrated in Fig.HJb). It is clear 
that the g-Gamma functions fit remarkably the empirical PDFs, 
which is also reported for other markets with At = Imin, 2 min 
and 3 min ll24ll25ll26ll27ll . We use the Cramer- von Mises cri- 
terion again and find the values of C^j for g-Gamma function 
are smaller than the critical value at the significance level 0.01, 
when At ranges from 1 min to 240 min. 

Table 1. Calibration of the q-Gamma model to trading volumes at 
different clock timescales. 



At 


e 


P 




q 


X 


a 


1 min 


0.41 


0.15 


1 


29 


0.52 


2.30 


2 min 


0.32 


0.22 


1 


28 


0.52 


2.35 


3 min 


0.42 


0.01 


1 


29 


0.42 


2.44 


4 min 


0.37 


0.08 


1 


28 


0.44 


2.49 


5 min 


0.41 


0.04 


1 


28 


0.44 


2.53 


6 min 


0.40 


0.06 


1 


28 


0.42 


2.51 


10 min 


0.39 


0.11 


1 


27 


0.21 


2.59 


15 min 


0.31 


0.26 


1 


26 


0.29 


2.59 


30 min 


0.19 


0.57 


1 


26 


0.27 


2.28 


60 min 


0.12 


1.09 


1 


23 


0.30 


2.26 


1 20 min 


0.09 


1.51 


1 


22 


0.27 


2.04 


240 min 


0.07 


2.05 


1 


20 


0.31 


1.95 



In contrast, the g-exponential function and log-normal func- 
tion have significant deviations from the empirical distribu- 
tions, so we do not display them in Fig.|4jb). But g-exponential 
grasps well the tail behaviors for small At and log-normal grasps 
well for large values for large At. This is not surprising since 
the g-exponential (Eq.®) is a monotonically decreasing func- 
tion and close to power law for large values while empirical 
PDFs show symmetrically for large values for large At, tend- 
ing to log-normal distribution. However, we cannot distinguish 



the g-Gamma and g-exponential functions in the tails. The esti- 
mated parameters using nonlinear least-squares regression for 
all the PDFs are listed in Table [T] All but one tail exponents a 
are larger than two. 

4.2 Tail exponents 

In order to further confirm that the tail exponents of trading vol- 
umes at different timescales are consistently outside the Levy 
regime, we also adopt the six different estimators used in Sec- 
tion [3] for the tail exponents. The results are presented in Ta- 
ble [21 For comparison, we also show the a values obtained 
from the g-Gamma fitting. For all the six tail exponent estima- 
tors, the value of a trends up with the increase of At. All the tail 
exponents obtained based on the MSB are less than two, while 
CSNE, LSE, FAE and RKE give tail exponents larger than two. 
For the Hill estimator, a is less than two when At ^ 6 and 
greater than two when At ^ 10. Again, we argue that the tail 
exponents of the trading volumes at different timescales do not 
belong to the Levy regime. 

5 Trading volume distribution for multiple 
trades 

We now study the distributions of trading volumes fi^n for 
multiple transactions. The trading volumes are normalized by 
the averages for different Ari. Fig.|3a) illustrates the empirical 
PDFs for An = 2'= with A: = 0, 1, 2, • • • , 8. It is found that the 
PDFs have power-law tails. We use g-Gamma functions to fit 
the PDFs, since the other distribution has significant deviations 
from the empirical distribution of SI An - The estimated parame- 
ters for all the PDFs are listed in Table[3] Fig.|5lb) demonstrates 
the fits for four typical PDFs with An = 2, 8, 32, 128. The q- 
Gamma fits the data with good quality for small values of An. 
For larger values of An, the fits deviate considerably from the 
empirical data. Again, the Cramer-von-Mises tests is also used 
for judging the goodness-of-fit. The results are favorable only 
when An ^ 8 at the significance level 0.01. The value of C^j 
increases with the increase of An when An is larger than 8. 
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Table 2. Estimating tail exponents of trading volumes at different clock timescales. 



At 






HE 




MSE 




CSNE 






LSE 






FAE 






RKE 




1 min 


2.30 


1 


71 ± 


03 


1 7,c; + n OOfi 


2 


10 ± 


24 


2 


43 ± 


09 


2 


80 ± 


27 


2 


17 ± 


07 


2 inin 




1 

i. 


sm- n 

oo zn u 


U I 


1 7^ _L fi nny 

1 . ( U ZC U .UU ( 













UU 


9 


ijo zn u 


lU 


9 


zn u 


1 n 

±u 


3 min 


2.44 


1 


88 ±0 


10 


1.77 ±0.014 


2 


30 ±0 


42 


2 


50 ±0 


08 


2 


93 ±0 


14 


2 


21 ±0 


08 


4 min 


2.49 


1 


91 ±0 


07 


1.76 ±0.011 


2 


61 ±0 


25 


2 


58 ±0 


11 


2 


97 ±0 


25 


2 


41 ±0 


11 


5 min 


2.53 


1 


94 ±0 


09 


1.78 ±0.016 


2 


62 ±0 


24 


2 


56 ±0 


11 


3 


04 ±0 


19 


2 


20 ±0 


11 


6 min 


2.51 


1 


95 ±0 


10 


1.78 ±0.009 


2 


60 ±0 


34 


2 


71 ±0 


11 


3 


01 ±0 


24 


2 


49 ±0 


09 


10 min 


2.59 


2 


20 ±0 


10 


1.79 ±0.016 


2 


67±0 


36 


2 


81 ±0 


21 


3 


13 ±0 


35 


2 


55 ±0 


10 


15 min 


2.59 


2 


28 ±0 


11 


1.79 ±0.027 


2 


80 ±0 


49 


2 


58 ±0 


31 


3 


37 ±0 


36 


2 


76 ±0 


19 


30 min 


2.28 


2 


46 ±0 


22 


1.81 ±0.06 


3 


12 ±0 


52 


2 


78 ±0 


25 


3 


17 ±0 


33 


2 


66 ±0 


21 


60 min 


2.26 


2 


45 ±0 


28 


1.82 ±0.018 


3 


54 ±0 


62 


2 


40 ±0 


49 


3 


50 ±0 


55 


2 


81 ±0 


18 


1 20 min 


2.04 


2 


46 ±0 


29 


1.83 ±0.021 


3 


35 ±0 


53 


2 


47 ±0 


37 


3 


26 ±0 


50 


2 


86 ±0 


20 


240 min 


1.95 


2 


51 ±0 


24 


1.85 ±0.061 


4 


99 ±0 


71 


3 


57 ±0 


14 


3 


08 ±0 


64 


3 


00 ±0 


50 




So we can declare that the applicability of the (/-Gamma func- 
tions for multiple trades is restricted to the transaction number 
An ^ 8. 

Table 3. Calibration of the g-Gamma model to trading volumes at 
different event timescales. 



An 





P 




<? 


X 


a 


1 


0.07 


1.52 


1 


22 


0.57 


2.03 


2 


0.04 


2.68 


1 


17 


0.47 


2.20 


4 


0.07 


2.56 


1 


16 


0.26 


2.60 


8 


0.07 


3.28 


1 


14 


0.29 


2.86 


16 


0.09 


3.45 


1 


12 


0.45 


3.88 


32 


0.09 


3.37 


1 


12 


0.65 


3.96 


64 


0.12 


3.41 


1 


10 


0.87 


5.59 


128 


0.11 


3.39 


1 


10 


0.92 


5.61 


256 


0.15 


3.52 


1 


07 


1.04 


9.77 



In order to determine the tail exponents, the six estimators 
mentioned in the previous sections are utilized. The resultant 
tail exponents are presented in Table ID The a values obtained 
from LSE, CSNE, FAE and RKE are greater than two, while 
those obtained from HE and MSE are less than two for small 



An. There is an overall tendency that a obtained from different 
method increases with An. This can be considered as a signif- 
icance of the central limit theorem meaning that for An oo, 
the distribution converges to a Gaussian. The fact that the limit 
distribution is not a Levy stable one is another indication that 
a > 2. 



6 Conclusion 

The distributions of trade sizes and trading volumes are inves- 
tigated based on the limit order book data of 22 liquid Chinese 
stocks listed on the Shenzhen Stock Exchange in the whole 
year 2003. The size distribution of trades for individual stocks 
exhibits jumps, which is caused by the fact that traders prefer to 
place orders with the size being certain numbers. The empiri- 
cal PDFs of trading volumes at different timescales At ranging 
from 1 min to 240 min can be modeled by the g-Gamma func- 
tions. In contrast, the empirical PDFs of trading volumes for 
multiple trades can be fitted by the (/-Gamma functions only for 
small numbers of trades. An ^ 8. All the empirical PDFs ex- 
hibit power-law tails. In order to determine the tail exponents, 
we aclopted six estimators (HE, MSE, LSE, CSNE, FAE and 
RKE). The estimated tail exponents using LSE, CSNE, FAE 
and RKE are greater than two, while those obtained from HE 
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Table 4. Estimating tail exponents of trading volumes at different event timescales. 



An 


a 


HE 


MSE 


CSNE 


LSE 


FAE 


RKE 


1 


2.03 


1.54 ±0.07 


1.75 ±0.003 


2.23 ±0.23 


2.33 ±0.08 


2.59 ±0.39 


2.13 ±0.23 


2 


2.20 


1.75 ±0.02 


1.81 ±0.006 


2.24 ±0.34 


2.54 ±0.09 


2.95 ±0.22 


2.31 ±0.20 


4 


2.60 


2.07 ±0.02 


1.87 ±0.005 


2.21 ±0.07 


2.52 ±0.05 


3.10 ±0.52 


2.35 ±0.17 


8 


2.86 


2.32 ±0.03 


1.94 ±0.005 


2.63 ±0.16 


2.81 ±0.07 


3.09 ±0.68 


2.80 ±0.19 


16 


3.88 


2.49 ±0.04 


2.01 ±0.010 


2.80 ±0.13 


2.87 ±0.07 


3.03 ±0.62 


3.20 ±0.34 


32 


3.96 


2.93 ±0.07 


2.09 ±0.018 


2.97 ±0.20 


2.92 ±0.08 


3.12 ±0.46 


3.57 ±0.44 


64 


5.59 


3.36 ±0.15 


2.18±0.011 


3.33 ±0.22 


3.18±0.10 


3.30 ±0.45 


3.97 ±0.58 


128 


5.61 


3.97 ±0.43 


2.27 ±0.026 


3.84 ±0.23 


3.26 ±0.13 


3.65 ±0.57 


4.20 ±0.69 


256 


9.77 


4.37 ±0.64 


2.39 ±0.042 


4.51 ±0.47 


3.69 ±0.16 


4.05 ±0.68 


5.19 ±0.77 



and MSE are less than two when At or An is small. Since HE 
and MSE may underestimate the tail exponents, we conclude 
that the tail exponents of trade sizes and trading volumes of 
Chinese stocks are well outside the Levy regime. 

This work was partly supported by the National Natural Science Foun- 
dation of China (Nos. 70501011 and 70502007), the Fok Ying Tong 
Education Foundation (No. 101086), the Program for New Century 
Excellent Talents in University (No. NCET-07-0288), and the China 
Scholarship Council (No. 2008674017). 
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